Gulfport
MULTIBENCH++: A Unified and Comprehensive Multimodal Fusion Benchmarking Across Specialized Domains
Xue, Leyan, Zhang, Changqing, Xue, Kecheng, Liu, Xiaohong, Wang, Guangyu, Han, Zongbo
Although multimodal fusion has made significant progress, its advancement is severely hindered by the lack of adequate evaluation benchmarks. Current fusion methods are typically evaluated on a small selection of public datasets, a limited scope that inadequately represents the complexity and diversity of real-world scenarios, potentially leading to biased evaluations. This issue presents a twofold challenge. On one hand, models may overfit to the biases of specific datasets, hindering their generalization to broader practical applications. On the other hand, the absence of a unified evaluation standard makes fair and objective comparisons between different fusion methods difficult. Consequently, a truly universal and high-performance fusion model has yet to emerge. To address these challenges, we have developed a large-scale, domain-adaptive benchmark for multimodal evaluation. This benchmark integrates over 30 datasets, encompassing 15 modalities and 20 predictive tasks across key application domains. To complement this, we have also developed an open-source, unified, and automated evaluation pipeline that includes standardized implementations of state-of-the-art models and diverse fusion paradigms. Leveraging this platform, we have conducted large-scale experiments, successfully establishing new performance baselines across multiple tasks. This work provides the academic community with a crucial platform for rigorous and reproducible assessment of multimodal models, aiming to propel the field of multimodal artificial intelligence to new heights.
- North America > United States > Florida > Alachua County > Gainesville (0.14)
- North America > United States > Texas > Harris County > Houston (0.04)
- Asia > China > Tianjin Province > Tianjin (0.04)
- (10 more...)
- Health & Medicine > Health Care Technology (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
- Health & Medicine > Therapeutic Area > Oncology (0.93)
- Health & Medicine > Therapeutic Area > Neurology (0.93)
3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification
Varahagiri, Shyam, Sinha, Aryaman, Dubey, Shiv Ram, Singh, Satish Kumar
In recent years, Vision Transformers (ViTs) have shown promising classification performance over Convolutional Neural Networks (CNNs) due to their self-attention mechanism. Many researchers have incorporated ViTs for Hyperspectral Image (HSI) classification. HSIs are characterised by narrow contiguous spectral bands, providing rich spectral data. Although ViTs excel with sequential data, they cannot extract spectral-spatial information like CNNs. Furthermore, to have high classification performance, there should be a strong interaction between the HSI token and the class (CLS) token. To solve these issues, we propose a 3D-Convolution guided Spectral-Spatial Transformer (3D-ConvSST) for HSI classification that utilizes a 3D-Convolution Guided Residual Module (CGRM) in-between encoders to "fuse" the local spatial and spectral information and to enhance the feature propagation. Furthermore, we forego the class token and instead apply Global Average Pooling, which effectively encodes more discriminative and pertinent high-level features for classification. Extensive experiments have been conducted on three public HSI datasets to show the superiority of the proposed model over state-of-the-art traditional, convolutional, and Transformer models. The code is available at https://github.com/ShyamVarahagiri/3D-ConvSST.
- Africa > Botswana (0.07)
- North America > United States > Mississippi > Harrison County > Gulfport (0.04)
- Asia > India (0.04)
- Asia > China > Hubei Province > Wuhan (0.04)
Augmented Language Models: a Survey
Mialon, Grégoire, Dessì, Roberto, Lomeli, Maria, Nalmpantis, Christoforos, Pasunuru, Ram, Raileanu, Roberta, Rozière, Baptiste, Schick, Timo, Dwivedi-Yu, Jane, Celikyilmaz, Asli, Grave, Edouard, LeCun, Yann, Scialom, Thomas
This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools. The former is defined as decomposing a potentially complex task into simpler subtasks while the latter consists in calling external modules such as a code interpreter. LMs can leverage these augmentations separately or in combination via heuristics, or learn to do so from demonstrations. While adhering to a standard missing tokens prediction objective, such augmented LMs can use various, possibly non-parametric external modules to expand their context processing ability, thus departing from the pure language modeling paradigm. We therefore refer to them as Augmented Language Models (ALMs). The missing token objective allows ALMs to learn to reason, use tools, and even act, while still performing standard natural language tasks and even outperforming most regular LMs on several benchmarks. In this work, after reviewing current advance in ALMs, we conclude that this new research direction has the potential to address common limitations of traditional LMs such as interpretability, consistency, and scalability issues.
- Asia > Middle East > Jordan (0.04)
- Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)
- North America > United States > Mississippi > Harrison County > Gulfport (0.04)
- (6 more...)
- Education (1.00)
- Leisure & Entertainment > Games (0.67)
- Information Technology > Services (0.45)
Shared Manifold Learning Using a Triplet Network for Multiple Sensor Translation and Fusion with Missing Data
Dutt, Aditya, Zare, Alina, Gader, Paul
Abstract--Heterogeneous data fusion can enhance the robustness and accuracy of an algorithm on a given task. However, due to the difference in various modalities, aligning the sensors and embedding their information into discriminative and compact representations is challenging. In this paper, we propose a Contrastive learning based MultiModal Alignment Network (CoMMANet) to align data from different sensors into a shared and discriminative manifold where class information is preserved. The proposed architecture uses a multimodal triplet autoencoder to cluster the latent space in such a way that samples of the same classes from each heterogeneous modality are mapped close to each other. Since all the modalities exist in a shared manifold, a unified classification framework is proposed. A comparison made with other methods demonstrates the superiority of this method. This method is also called decision fusion. In the context of a neural network, these outstanding results on tasks like land-use and land-cover representations are generated by the convolutional layers classification (LULC) [1] [2], mineral exploration [3] [4] and fused gradually to form a shared representation [5], urban planning [6], biodiversity conservation [7], sentiment layer. In Fusion methods can be classified into two groups: concatenation and alignment-based methods. Personal use of this material is permitted. To increase the interpretability learn spatial information by using a structured morphological of fusion models, Hong et al. [27] proposed a element of predefined size and shape. They proposed a graphbased shared and specific feature learning (S2FL) that is capable of model to couple the dimension reduction and fusion of decomposing data into modality-shared and modality-specific information. However, using this method, the cloud-covered components, which enables a better information blending of regions are not accurately classified because the morphological multiple heterogeneous modalities.
- North America > United States > Florida > Alachua County > Gainesville (0.14)
- North America > United States > Mississippi > Harrison County > Gulfport (0.04)
- North America > United States > Alaska > Denali Borough > Healy (0.04)
- Asia > Middle East > Syria > Daraa Governorate > Dar'a (0.04)
PEER: A Collaborative Language Model
Schick, Timo, Dwivedi-Yu, Jane, Jiang, Zhengbao, Petroni, Fabio, Lewis, Patrick, Izacard, Gautier, You, Qingfei, Nalmpantis, Christoforos, Grave, Edouard, Riedel, Sebastian
Textual content is often the output of a collaborative writing process: We start with an initial draft, ask for suggestions, and repeatedly make changes. Agnostic of this process, today's language models are trained to generate only the final result. As a consequence, they lack several abilities crucial for collaborative writing: They are unable to update existing texts, difficult to control and incapable of verbally planning or explaining their actions. To address these shortcomings, we introduce PEER, a collaborative language model that is trained to imitate the entire writing process itself: PEER can write drafts, add suggestions, propose edits and provide explanations for its actions. Crucially, we train multiple instances of PEER able to infill various parts of the writing process, enabling the use of self-training techniques for increasing the quality, amount and diversity of training data. This unlocks PEER's full potential by making it applicable in domains for which no edit histories are available and improving its ability to follow instructions, to write useful comments, and to explain its actions. We show that PEER achieves strong performance across various domains and editing tasks.
- North America > United States > California > Los Angeles County > Inglewood (0.28)
- North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
- North America > United States > Mississippi > Harrison County > Gulfport (0.04)
- (16 more...)
Robust Semi-Supervised Classification using GANs with Self-Organizing Maps
Fick, Ronald, Gader, Paul, Zare, Alina
Generative adversarial networks (GANs) have shown tremendous promise in learning to generate data and effective at aiding semi-supervised classification. However, to this point, semi-supervised GAN methods make the assumption that the unlabeled data set contains only samples of the joint distribution of the classes of interest, referred to as inliers. Consequently, when presented with a sample from other distributions, referred to as outliers, GANs perform poorly at determining that it is not qualified to make a decision on the sample. The problem of discriminating outliers from inliers while maintaining classification accuracy is referred to here as the DOIC problem. In this work, we describe an architecture that combines self-organizing maps (SOMs) with SS-GANS with the goal of mitigating the DOIC problem and experimental results indicating that the architecture achieves the goal. Multiple experiments were conducted on hyperspectral image data sets. The SS-GANS performed slightly better than supervised GANS on classification problems with and without the SOM. Incorporating the SOMs into the SS-GANs and the supervised GANS led to substantially mitigation of the DOIC problem when compared to SS-GANS and GANs without the SOMs. Furthermore, the SS-GANS performed much better than GANS on the DOIC problem, even without the SOMs.
- North America > United States > Florida > Alachua County > Gainesville (0.14)
- North America > United States > Mississippi > Harrison County > Gulfport (0.04)
- North America > United States > California (0.04)
- (2 more...)
- Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.88)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.36)
'A.I., Captain': The Robotic Navy Ship of the Future
The swells in the middle of the North Pacific were reaching nine feet when one of two engines on the diesel-powered U.S. naval ship called Sea Hunter shut down. About 1,500 nautical miles from its home base in San Diego, the 132-foot-long craft, which had been cruising at 10 knots, couldn't send a member of its crew to check out the problem--because it didn't have a crew. Sea Hunter's sleek, spiderlike silhouette, with a narrow hull and two outriggers, is a prototype of what could be a new class of autonomous warships for the U.S. Navy. Its artificial intelligence–based controls and navigation system, designed by Leidos Holdings, a defense contractor based in Reston, Va., were seven years in the making. And this maiden voyage--a more than 4,000-mile roundtrip to the giant Pearl Harbor naval station--was its first major proof of concept. Nothing like this had ever been attempted before.
- North America > United States > California > San Diego County > San Diego (0.25)
- North America > United States > Virginia > Fairfax County > Reston (0.24)
- Asia > Russia (0.14)
- (12 more...)
- Transportation > Air (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
- Government > Military > Navy (1.00)
Health Catalyst, Regenstrief partner to commercialize natural language processing technology
Health Catalyst and the Regenstrief Institute are working together to commercialize nDepth, Regenstrief's natural language processing technology. Indianapolis-based Regenstrief developed the technology to harness unstructured data. Salt-Lake City-based Health Catalyst, a data warehousing and analytics company, has been in the business of extracting data to boost care quality since it launched in 2008. It was developed within the Indiana Health Information Exchange, the largest and oldest HIE in the country. Regenstrief fine-tuned nDepth through extensive and repeated use, searching more than 230 million text records from more than 17 million patients.
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.27)
- North America > United States > Indiana > Marion County > Indianapolis (0.27)
- North America > United States > Mississippi > Harrison County > Gulfport (0.07)
- Materials > Chemicals > Specialty Chemicals (0.96)
- Health & Medicine > Health Care Providers & Services (0.61)
DARPA's Self-Driving Submarine Hunter Steers Like a Human
Today is christening day for DARPA's Sea Hunter, a full-scale prototype of an autonomous surface vessel that's designed to be able to launch from a pier and go out on its own for weeks or months at a time, for thousands of miles at a stretch. The 132-foot-long, diesel-powered vessel was built by U.S. defense contractor Leidos under DARPA's ACTUV program, a somewhat clunky nested acronym that stands for Anti-Submarine Warfare (ASW) Continuous Trail Unmanned Vessel. The ship, now a joint project with the U.S. Office of Naval Research, was originally conceived as a tracker of stealthy diesel-electric submarines, but it's a flexible platform. "What we've kind of realized over the course of the program is that it's a truck," program manager Scott Littlefield tells IEEE Spectrum. "It's got lots of payload capacity for a variety of different missions."
- North America > United States > California > San Diego County > San Diego (0.07)
- North America > United States > Oregon > Multnomah County > Portland (0.05)
- North America > United States > Mississippi > Harrison County > Gulfport (0.05)
- Government > Regional Government > North America Government > United States Government (1.00)
- Government > Military (1.00)